Wide-Coverage Lexicalized Grammars

نویسندگان

  • Cristina Barbero
  • Vincenzo Lombardo
چکیده

This paper proposes a hierarchical organization of the linguistic knowledge, that views grammar as an abstraction of item-dependent information (in particular, an abstraction of subcategorization frames into a hierarchy of classes). The formalism has been successfully applied to a classiication of 105 Italian verbal frames, developed by analysing a corpus of about 500,000 words. The proposed framework (expressed in a dependency approach) is of linguistic and computational interest. From a linguistic point of view, it is a clear, signiicant and non-redundant representation. From a computational point of view, structuring the grammar into a hierarchy allows to deene a predictive component for parsing, exploiting the information at many levels of the hierarchy: this allows to reduce the ambiguity, a very big problem in large scale NLP systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Python-based Interface for Wide Coverage Lexicalized Tree-adjoining Grammars

This paper describes the design and implementation of a Python-based interface for wide coverage Lexicalized Tree-adjoining Grammars. The grammars are part of the XTAGGrammar project at the University of Pennsylvania, which were hand-written and semi-automatically curated to parse real-world corpora. We provide an interface to the wide coverage English and Korean XTAG grammars. Each XTAG gramma...

متن کامل

Automaton-based Parsing for Lexicalized Grammars

In wide-coverage lexicalized grammars many of the elementary structures have substructures in common. This means that during parsing some of the computation associated with diierent structures is duplicated. This paper explores ways in which the grammar can be precompiled into nite state automata so that some of this shared structure results in shared computation at run-time.

متن کامل

Extraction of Tree Adjoining Grammars from a Treebank for Korean

We present the implementation of a system which extracts not only lexicalized grammars but also feature-based lexicalized grammars from Korean Sejong Treebank. We report on some practical experiments where we extract TAG grammars and tree schemata. Above all, full-scale syntactic tags and well-formed morphological analysis in Sejong Treebank allow us to extract syntactic features. In addition, ...

متن کامل

Lexicalization and Grammar Development Lexicalization and Grammar Development

In this paper we present a fully lexicalized grammar formalism as a particularly attractive framework for the specification of natural language grammars. We discuss in detail Feature-based, Lexicalized Tree Adjoining Grammars (FB-LTAGs), a representative of the class of lexicalized grammars. We illustrate the advantages of lexicalized grammars in various contexts of natural language processing,...

متن کامل

Lexicalization and Grammar Development

In this paper we present a fully lexicalized grammar formalism as a particularly attractive framework for the specification of natural language grammars. We discuss in detail Feature-based, Lexicalized Tree Adjoining Grammars (FB-LTAGs), a representative of the class of lexicalized grammars. We illustrate the advantages of lexicalized grammars in various contexts of natural language processing,...

متن کامل

Categorial Dependency Grammars: from Theory to Large Scale Grammars

Categorial Dependency Grammars (CDG) generate unlimited projective and non-projective dependency structures, are completely lexicalized and analyzed in polynomial time. We present an extension of the CDG, also analyzed in polynomial time and dedicated for large scale dependency grammars. We define for the extended CDG a specific method of “Structural Bootstrapping” consisting in incremental con...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997